Survey on Self Adaptive Semantic Focused Crawling Using Ontology Learning

نویسنده

  • S. Bhargavi
چکیده

The Internet today has become a vast storehouse for a scintillating amount of knowledge. It is an excellent source of information catering to the needs of people of varied interests. But this process of information retrieval does have its shortcomings too viz. heterogeneity, ubiquity and ambiguity. Thus a self-adaptive semantic focused crawler SASF crawler that addresses these issues and optimizes the process of information discovery and indexing of the searched information is proposed. This framework encompasses the concepts of semantic focused crawling and ontology learning that helps to maintain the performance of the crawler in spite of the variety in the Web environment. The innovation here is the unsupervised vocabulary-based ontology learning and a hybrid matching algorithm that matches semantically relevant concepts and metadata. Finally the performance of the crawler is evaluated based on various parameters. Index Terms Ontology learning, Semantic focused crawling, Hybrid matching algorithm, SASF crawler, Information

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Ontology Based Approach for Services Information Discovery using Hybrid Self Adaptive Semantic Focused Crawler

Focused crawling is aimed at specifically searching out pages that are relevant to a predefined set of topics. Since ontology is an all around framed information representation, ontology based focused crawling methodologies have come into exploration. Crawling is one of the essential systems for building information stockpiles. The reason for semantic focused crawler is naturally finding, comme...

متن کامل

Prioritize the ordering of URL queue in Focused crawler

The enormous growth of the World Wide Web in recent years has made it necessary to perform resource discovery efficiently. For a crawler it is not an simple task to download the domain specific web pages. This unfocused approach often shows undesired results. Therefore, several new ideas have been proposed, among them a key technique is focused crawling which is able to crawl particular topical...

متن کامل

Mining Service Using Ontology Learning

It is well recognized that the Internet has become the largest marketplace in the world, and online advertising is very popular with numerous industries, including the traditional mining service industry where mining service advertisements are effective carriers of mining service information. However, service users may encounter three major issues – heterogeneity, ubiquity, and ambiguity, when ...

متن کامل

Semantic Focused Crawling for Retrieving E-Commerce Information

Focused crawling is proposed to selectively seek out pages that are relevant to a predefined set of topics without downloading all pages of the Web. With the rapid growth of the E-commerce, how to discovery the specific information such as about buyer, seller and products etc. adapting for the online business user becomes a focused issue to the information search engine. We present a novel sema...

متن کامل

Ontology-Focused Crawling of Web Documents and RDF-based Metadata

The enormous growth of the World Wide Web in recent years has made it important to develop document discovery mechanisms based on intelligent and focused crawling techniques. The next-generation Web, the Semantic Web, that is currently being developed as a meta Web, building on the existing one, changes the classical crawling task. Metadata that is based on ontologies will exist in the form of ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015